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A general and basic model of primordial evolution — a soup of reacting finitary and discrete 
processes — is employed to identify and analyze fundamental mechanisms that generate and maintain 
complex structures in prebiotic systems. The processes — e-machines as defined in computational 
mechanics — and their interaction networks both provide well defined notions of structure. This 
enables us to quantitatively demonstrate hierarchical self-organization in the soup in terms of com- 
plexity. We found that replicating processes evolve the strategy of successively building higher levels 
of organization by autocatalysis. Moreover, this is facilitated by local components that have low 
structural complexity, but high generality. In effect, the finitary process soup spontaneously evolves 
a selection pressure that favors such components. In light of the finitary process soup's generality, 
these results suggest a fundamental law of hierarchical systems: global complexity requires local 
simplicity. 
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I. INTRODUCTION 



soup as a constructive, unrestricted dynamical system. 



The very earliest stages of evolution — or rather, pre- 
evolution — remain a mystery. How did structure emerge 
in a system of simple interacting objects, such as 
molecules? How was this structure commandeered as 
substrate for subsequent evolution — evolution that con- 
tinued to transform the objects themselves? One wonders 
if this recursive interplay between structure and dynam- 
ics facilitated the emergence of complex and functional 
organizations. Since these questions concern the most 
fundamental properties of evolutionary systems, we ex- 
plore them using principled and rigorous methods. 

To build a suitable model a few basic ingredients are 
required. First, one needs some type of elementary ob- 
jects that constitute the state of the system at its finest 
resolution. Second, one needs rules for how the objects 
interact. Third, one needs an environment in which the 
objects interact. Fourth, one needs quantitative and cal- 
culable notions of structure and organization. These re- 
quirements led us to the finitary process soup model of 
primordial evolution Simply stated, the soup's ingre- 
dients are, in order, e-machines, their functional compo- 
sition, a flow reactor, and the structural complexity C M 
of e-machines. 

After explaining each of these ingredients, we will re- 
late the model to classical replicator dynamics by reduc- 
ing the soup to a special case. We then move on to con- 
trast the limited case with the full-fledged finitary process 
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II. OBJECTS: e-MACHINES 

Here we employ a finite-memory process called an e- 
machine 0, 01, 0] , as our preferred representation of an 
evolving information-processing individual. Using a pop- 
ulation of e-machines is particularly appropriate in study- 
ing self-organization and evolution from an information- 
theoretic perspective as they allow quantitative measure- 
ments of storage capacity and randomness. Rather than 
using the abstraction of a formal language — an arbitrary 
finite set of finite length words — we consider a discrete- 
valued, discrete-time stationary stochastic process de- 
scribed by a bi-infinite sequence of random variables St 
over an alphabet A: 



S— ...SiSqSi. 



(1) 



A process stores information in a set of causal states 
that are equivalence classes of semi-infinite histories that 
condition the same probability distribution for future 
states. More formally, the causal states S of a process 

are the members of the range of the map e : S 2 s 
from histories to sets of histories: 



e(s) = {*s |P(5 | S= s) = P(5 | S= V )} 



(2) 
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where 2 s denotes the power set of S- Further, let S E S 

be the current casual state, S' its successor, and S the 
next symbol in the sequence ([I]). The transition from 
one causal state Si to another Sj that emits the symbol 
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FIG. 1: Three examples of e-machines. Ta represents the 
identity function and has the causal state A. Tb has two 
causal states (A and B), accepts the input string 1010 ... or 
0101 . . ., and operates by flipping 0s to Is and vice versa. To 
has the same domain and range as Tb , but maps input strings 
onto themselves. 



s G A is given by a set of labeled transition matrices: 
T = {T// } :seA}, where 



T, 



(3) 



The e-machine of a process is the ordered pair {S,T}. 
One can show that it is the minimal, maximally predic- 
tive causal representation of the process Unlike a 
general probabilistic e-machine, for simplicity, here we 
take causal-state transitions to have equal probabilities. 
The finitary e-machines can be thought of as finite-state 
machines with a certain properties [J]: (1) All states are 
start states and accepting states; (2) All recurrent states 
form a single strongly connected component; (3) All tran- 
sitions are deterministic: A causal state together with the 
next value observed from the process determines a unique 
next causal state; And (4) the set of causal states is min- 
imal. Here we use an alphabet of input and output pairs 
over a binary alphabet: A = {0|0, 0|1, 1|0, 1|1}. This im- 
plies that the e-machines work as mappings between sets 
of strings. In other words, they are transducers [f|. 

In contrast to prior models of pre-biotic evolution, e- 
machines are simply finitely-specified mappings. More to 
the point, they do not have two separate modes of rep- 
resentation (information storage) or functioning (trans- 
formation). The advantage is that there is no assumed 
distinction between gene and protein 0, 0] or between 
data and program (8|, H, Hi! EH- Instead, one recovers 
the dichotomy by projecting onto (i) the sets that an e- 
machine recognizes and generates and (ii) the mapping 
between these sets. Examples of e-machines are shown 
in Figure [TJ 



III. INTERACTION: FUNCTIONAL 
COMPOSITION 

The basic pairwise interaction we use in the finitary 
process soup is functional composition. Two machines 
interact and produce a third machine — their composi- 
tion. Composition is not a symmetric operation. Ma- 
chine Ta composed with another Tb does not necessarily 
result in the same machine as Tb composed with Ta'- 
T b oT a ^T a ° T B - 



The upper bound on the number of states of the com- 
position is the product of the number of states of the 
parents: \Tb ° T A \ < \Tb\ X \T~a\- Hence, there is the 
possibility of exponential growth of states and machine 
complexity if machines are iteratively composed. A com- 
position, though, may also result in a machine with lower 
complexity than those of its parents. 



A. Interaction networks 

We represent the interactions among a set of e- 
machines with an interaction network Q which is a a 
graph whose nodes correspond to e-machines and whose 
transitions correspond to interactions. If T^ — Tj o Tj 
occurs in the soup, then the edge from T; to Tk is labeled 



Tj. One may represent Q with the binary matrices: 



,(fc) 



1 if T fc = Tj o Ti 
otherwise. 



(4) 



Consider the e-machines in Fig. [TJ for example. They 
are related via composition according to the interaction 
graph shown in Fig. O which is given by the matrices 
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and 



7(C) 



1 





1 

1 1 
1 



1 

1 

1 1 



(5) 



(6) 



(7) 



B. Meta- machines 

For a machine to survive in its environment somehow 
it needs to produce copies of itself. This can be done di- 
rectly by self-reproduction, e.g. Ta °Ta — Ta, or it can 
be done indirectly in cooperation with other machines: 
e.g., Ta facilitates the production of Tb, which facilitates 
the production of Tc, which, in turn, closes the loop by 
facilitating the production of Ta- In other words, there 
can be sets of machines that interact with each other in 
such a way that they collectively self-reinforce the over- 
all production of the set. This leads to the notion of an 
autonomous and self-replicating entity, which we call a 
meta-machine. Inspired by Maturana and Varela's au- 
topoietic set [12], Eigen and Schuster's hypercycle [l3l |. 
and Fontana and Buss' organization [l4j, we define a 
meta-machine O to be a connected set of e-machines 
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FIG. 2: Interaction network of the e-machines in Fig. \T\ 
There is a transition, for example, that is labeled Tc from 
the node Ta to the node Tc, since Ta composed with Tc 
results in Tc (in fact, each transition from Ta has the same 
label as the label of its respective sink node since Ta is the 
identity function). 



whose interaction matrix consists of all and only the 
members of the set. That is, a set S7 is a meta-machine 
if and only if (1) the composition of two e-machines from 
the set is always itself a member of the set: 

TjoTiGSl, VT; ,Tj £ Q ; (8) 

(2) all e-machines in the set can arise from the composi- 
tion of two machines in the set: 

BT^TjeQ, T k =T j0 T h VT fe eQ; (9) 

and (3) there is a nondirected path between every pair of 
nodes in Si's interaction network Qq. The third property 
ensures that there is no subset of SI that is isolated from 
the rest of S7 under composition. Consider, for example, 
the union of two self-replicators, Ta and Tg, for which 
TboTa = Ta°Tb = Tq. According to property (3), they 
are not a meta-machine. 



IV. COMPLEXITY MEASURES: C M 

In previous computational pre-biotic models, the ob- 
jects have been represented by, for example, assembly 
language codes [1, H, [TH fTll | ta gs [TBI . [lrij , A-expressions 
[]~7l ] and cellular automata [Hj]. We employ e-machines 
instead mainly for one reason: there is a well devel- 
oped theory (computational mechanics) of their struc- 
tural properties. Assembly language programs and A- 
expressions, for instance, are computational universal 
representations and so one knows that it is not possible 
to calculate their complexity 

For finitary e-machines, in contrast, complexity can be 
readily defined and analytically calculated in closed form. 
Define the stochastic connection matrix of an e-machine 
M = {S,T} as T = Y,seA T(s) - Thc probability dis- 
tribution ps over the states in S — how often they are 
visited — is given by the normalized left eigenvector of T 
associated with eigenvalue 1. 




FIG. 3: A schematic illustration of the finitary process soup. 
Two e-machines, Ta and Tb, are composed and produce a 
third machine Tc, or a random machine Tr is introduced to 
the soup. In either case, another randomly selected machine 
To is removed to maintain a fixed population size. Note that 
this is a well stirred setting, and so there is no spatial rela- 
tionship in the population. 



The structural complexity of M is the Shannon en- 
tropy of the distribution given by ps , 

The structural complexity of an e-machine is the amount 
of information stored in the distribution over S, which 
is the minimum average amount of memory needed to 
optimally predict future configurations 0]. 

To measure the diversity of interactions in the soup we 
define the interaction network complexity C^iff) to be the 
Shannon entropy of the distribution of effective transition 
probabilities in the graph Q. We consider, in particular, 
only the transitions that have occurred between machine 
types that are present. That is, 

where 

v k _ I PiPj/ J2 v ij if T k = T j Ti has occurred 
13 1 otherwise. 

and pi is the fraction of machines of type i in the popula- 
tion. To monitor the emergence of actual and functional 
reproduction paths, we consider only those interactions 
that occurred in the population. 

V. FRAMEWORK: THE SOUP 

The e-machines interact in a well stirred reactor with 
the following iterated dynamics: 

1. Production and influx: 



4 



(a) With probability <E>i n generate a random e- 
machine T^. 

(b) With probability 1 — <&i„ (reaction): 

i. Select Ta and Tb randomly. 

ii. Form the composition Tq = Tb ° Ta- 

2. Out flux: 

(a) Select an e-machine To randomly from the 
population. 

(b) Replace To with the e-machine produced in 
the previous step — either T c or T R . 

Tr is uniformly sampled from the set of all two-state 
e-machines in our simulations (see below). This sampling 
is also used when initializing the population. The inser- 
tion of Tr corresponds to an influx while the removal 
of Td corresponds to an outflux. The latter keeps the 
population size constant. See Fig. [3] for a schematic il- 
lustration. There is no spatial dependence in this version 
of the soup as e-machines are sampled uniformly from 
the population for each replication and removal. 

VI. CLOSED POPULATION DYNAMICS 

To familiarize ourselves with the model we first exam- 
ine a simple base case: a soup with no influx that is 
initialized with machines taken from a finite set which 
is closed under composition. This case is also intended 
to work as a bridge between classical population dynam- 
ics and the general, constructive dynamics of the Unitary 
process soup. The closure with respect to composition 
enables us to describe the system's temporal dynamics 
of e-machine concentrations by a coupled system of or- 
dinary differential equations. In the limit of an infinite 
soup size, the rate equation of concentration pk of ma- 
chine type Tk is given by 

Pk = ipk - ®outPk, k=l,...,n, (13) 

where the conditional production rate ipk is the probabil- 
ity that Tk is produced given that two randomly sampled 
machines are paired: 

n 

ijj k = 53 a ^p*Pj> ( i4 ) 

ij = l 

and a^A is a second-order reaction rate constant: 

13 1 otherwise. 

The outflux <E> out equals the total production rate of the 
soup — i.e., the probability that a reaction occurs given 
that two e-machines are paired. It keeps the size of the 
soup constant: 

n 
i=l 
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FIG. 4: A simple base case: Machine type frequencies of 
mono-machines as functions of time. N (= 100, 000) denotes 
the population size. Dashed lines: simulation; solid lines: Eq. 



Given a soup with no influx, <!% = 0, that hosts ma- 
chines which are members of a set that is closed under 
composition, the distribution dynamics can alternatively 
be predicted from by its interaction network: 

pW = Pt _ 1 .^.p?l 1 Z- 1 , (17) 

(k) 

where p t is the frequency of e-machine type k at time 
t and Z~ l is a normalization factor. This approximates 
the soup's elements as updating synchronously. 

We illustrate the closed case by initiating the soup 
with machines that consist of only a single state — 
mono-machines. There are 15 mono-machines, the null 
(transition-less) transducer is excluded, and they form a 
closed set under composition. See Fig. [4]for the temporal 
dynamics of their respective frequencies. Nine machine 
types remain in the population at equilibrium. They 
form a meta-machine M. with C )Jl (Ai) = 5.75 bits. In 
this case, since C^Tj) = for all mono-machines T i: the 
population's structural complexity derives only from its 
interaction network. 



VII. OPEN POPULATION DYNAMICS 

We now move on to the general case of a soup with 
positive influx rate consisting of e-machines of arbitrary 
size. The soup then constitutes a constructive dynami- 
cal system where there is a mutual dependence between 
its equations of motion and the individuals. Due to the 
openness, Eqs. (fT3)) - ([T7| do not necessarily apply. We 
therefore turn to simulations. 

In order to study dynamics that is ruled solely by com- 
positional transformations we first set the influx rate to 
zero. A fine-grained description of the soup's history on 
the e-machine level is given by a genealogy — a record of 
descent of machine types. By studying the example in 
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FIG. 5: Genealogy of e-machine types in a soup with 100 
machines. A solid line denotes that a machine type is present 
in the soup. Dashed lines (drawn from the parents to the 
child) denote composition. Note that almost the the whole 
set of initial e-machine types (with one exception) is replaced 
by the dynamics. 
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FIG. 6: Decomposition of meta-machines in a soup with no 
influx. Superimposed plots of C^Q) from 15 separate runs 
with N = 500. Cfj,(G) is bounded by 4 bits while a 4-element 
meta-machine (shown), denoted 04, is the largest one in the 
soup. Q.4 decays to ^2, a 2-element meta-machine (shown) 
due to fluctuations, that in turn decays to Hi, a single self- 
reproducing e-machine. 



Fig. [5l a simulation with N = 100 individuals, one im- 
portant observation is that nearly all the e-machine types 
that are present in the soup's initial population are re- 
placed over time. Thus, genuine novelty emerges, in con- 
trast to the closed soup just described. Initially, there 
is a rapid innovation phase in which novel machines are 
introduced that displace the bulk of the initial machines. 
The degree of innovation flattens out, along with the di- 
versity of the soup, and eventually vanishes as the popu- 
lation becomes increasingly closed under composition. 

To monitor the soup's organization over time, we su- 
perimpose C M (C?) time series from several runs in Fig. 
23 One sees that plateaus are formed. These can be ex- 
plained in terms of meta-machines. In addition to captur- 
ing the notion of self-replicating entities, meta-machines 
also describe an invariant set of the population dynamics. 
That is, formally, 

n = GoQ, (18) 

where fl is the set of e-machines present in the population 
and Q is their interaction network. These invariant sets 
can be stable or unstable under the population dynamics. 

Consider, for example, the meta-machine in Fig. O It 
is unstable, since Tas are only produced by T^s, and will 
decay over time to the meta-machine of Fig. [7J This 
also illustrates, by the way, how trivial self-replication is 
spontaneously attenuated in the soup. 

The plateaus at C^(Q) — 4 bits, C M (C?) = 2 bits, and 
C/j,(G) = bits correspond to the largest meta-machine 
that is present at that time. Since a meta-machine by 
definition is closed under composition, it itself does not 
produce novel machines; thus, one has the upper bound 
of C M (C?). As a meta-machine is reduced due to an inter- 
nal instability or sampling fluctuations by the outflux, 
the upper bound of C^Q) is lowered. This results in 




FIG. 7: The resulting meta-machine when the meta-machine 
in Fig. [5] decays under the population dynamics of Eq. (|17p . 



a stepwise and irreversible succession of meta-machine 
decompositions. Fig. [6] shows only three plateaus. In 
principle, however, there is one plateau for every meta- 
machine that at some point is the largest one in the pop- 
ulation. The diagram in Fig. [5] summarizes our results 
from a more systematic survey of spontaneously gener- 
ated meta-machine hierarchies in simulations of soups 
with 500 e-machines. 

We now examine the effects of influx by studying the 
population-averaged e-machine complexity (C M (T)) and 
the run-averaged interaction network complexity {C^Q)) 
as a function of t and <&i„, see Fig. |H1 

The average e-machine complexity (C M (T)) increases 
rapidly initially before declining to a steady state. The 
average interaction network complexity {C^{Q)) is rela- 
tively high where the average structural complexity of the 
e-machines is low, and is maximized at $i„ rs 0.1. Higher 
influx rates have a destructive effect on the populations' 
interaction network due to the new individuals' low re- 
production rate. (C M (T)) is, in contrast, maximized at a 
relatively high influx rate ($j„ « 0.75) at which (0^(0)) 
is relatively small. The maximum network complexity 
C/j,(Q) of the population grows linearly at a positive rate 
of approximately 7.6 • 10~ 4 bits per replication. 
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FIG. 8: Composition and decomposition hierarchy of meta- 
machines. Dots denote self-replicating e-machines, solid lines 

denote Ta Tc transitions and dashed lines denote equiv- 
alent Tb Tc transitions. The label of the source node 
and the transition label are interchanged in the latter transi- 
tion type. This results in a redundant representation of the 
interaction network, which is used to show how the meta- 
machines are related. The interaction networks are shown in 
a simplified way according to Q4; cf. Fig. [6] 



VIII. DISCUSSION 

We presented a conceptual model of pre-biotic evolu- 
tion: a soup consisting of objects that make new objects 
[l|. The objects are e-machines and they generate new 
e-machines by functional composition. The soup consti- 
tutes a constructive dynamical system since the popula- 
tion dynamics is not fixed and may itself evolve along 
with the state space it operates on. Specifically, the di- 
mension of the state space changes over time, which is 
reminiscent of the constructive population dynamics as- 
sociated with punctuated equilibria [l9[ . 

In principle, this allows for open-ended evolution. The 
quantitative estimate quoted above for the linear growth 
of the interaction network complexity supports this in- 
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FIG. 9: (a) Population- and run-averaged e-machine complex- 
ity (C M (T)) and (b) run-averaged interaction network com- 
plexity (C M (C/)) as a function of time t and influx rate <&i„ for 
a population of N — 100 objects. (Reprinted with permission 
from d). 



triguing possibility occurring in the open finitary pro- 
cess soup. In the case of no influx, though, the sys- 
tem reaches a steady state where the soup consists of 
only one self-replicator. Growth and maintenance of or- 
ganizational complexity requires that the system is dis- 
sipative; i.e., that there is a small, but steady inflow 
of random e-machines. Notably, in this case, the soup 
spontaneously evolves hierarchical organizations in the 
population — meta-machines that in turn are organized 
hierarchically. 

These hierarchies are assembled from noncomplex, 
general individual e-machines. In this way, the soup's 
emergent complexity derives largely from a network of 
interactions, rather than from the unbounded increase 
in the structural complexity of individuals. It appears, 
therefore, that higher-order complex organization not 
only allows for simple local components but, in fact, re- 
quires them. 
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